# Low-latency Speech Interaction
Ultravox V0 4 1 Llama 3 1 70b
MIT
Ultravox is a multimodal speech large language model, built upon the pre-trained Llama3.1-70B-Instruct and whisper-large-v3-turbo backbones, capable of receiving both speech and text as inputs.
Text-to-Audio
Transformers Supports Multiple Languages

U
fixie-ai
204
24
Ultravox V0 4 1 Llama 3 1 8b
MIT
Ultravox is a multimodal speech large language model built on Llama3.1-8B-Instruct and whisper-large-v3-turbo, capable of processing both speech and text inputs.
Audio-to-Text
Transformers Supports Multiple Languages

U
fixie-ai
747
97
Featured Recommended AI Models